REALM: A Loop Memory Access Optimization Technique with Loop-Carried Data Dependence Analysis for DSP Applications

نویسندگان

  • Meng Wang
  • Zili Shao
چکیده

Reducing memory accesses is particularly important for DSP applications since they are widely used in embedded systems and need to be executed with high performance and low power consumption. In this paper, we focus on optimizing loops which are the most critical sections for DSP applications. We propose a machineindependent intermediate-code-level loop memory access optimization technique, REALM (REdundAnt Load Exploration & Migration), to explore hidden redundant loads and migrate them outside loops based on loopcarried data dependence analysis. In REALM, we first build up a data-flow graph to describe the inter-iteration data dependencies among memory operations. Then we perform code transformation by exploiting these dependencies with registers to hold the values of unnecessary loads and migrating these loads outside loops. Different from the previous work based on data-flow-analysis, our data-flow-graph-based approach is easy to be implemented and more suitable for optimizing loop kernels of DSP applications that have simple control-flow structure. We implement our technique into the IMPACT compiler [24] and conduct experiments using a set of benchmarks from DSPstone [28] on the cycle-accurate VLIW simulator of Trimaran [1]. The experimental results show that our technique significantly reduces the number of memory accesses.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Layout-Conscious Iteration Space Transformation Technique

ÐExploiting locality of references has become extremely important in realizing the potential performance of modern machines with deep memory hierarchies. The data access patterns of programs and the memory layouts of the accessed data sets play a critical role in determining the performance of applications running on these machines. This paper presents a cache locality optimization technique th...

متن کامل

Hierarchical Memory Size Estimation for Loop Transformation and Data Memory Platform Optimization

In today’s embedded systems, the memory hierarchy is rapidly becoming a major bottleneck in terms of power, performance and area, due to the very large amount of (memory related) data need to be transferred and stored (temporarily). This is especially the case for portable multi-media applications systems. These applications are characterized by deep loop nests and multi-dimensional arrays at t...

متن کامل

Design and Implementation of Digital Demodulator for Frequency Modulated CW Radar (RESEARCH NOTE)

Radar Signal Processing has been an interesting area of research for realization of programmable digital signal processor using VLSI design techniques. Digital Signal Processing (DSP) algorithms have been an integral design methodology for implementation of high speed application specific real-time systems especially for high resolution radar. CORDIC algorithm, in recent times, is turned out to...

متن کامل

Loop Distribution and Fusion with Timing and Code Size Optimization for Embedded DSPs

Loop distribution and loop fusion are two effective loop transformation techniques to optimize the execution of the programs in DSP applications. In this paper, we propose a new technique combining loop distribution with direct loop fusion, which will improve the timing performance without jeopardizing the code size. We first develop the loop distribution theorems that state the legality condit...

متن کامل

Analysis of Loop Behavior of Selectable Mode Vocoder (SMV) and Its Impact of Instruction Level Parallelism

The digital signal processor (DSP) industry has been growing steadily over the past few years due to strong demands for digital signal processors in a variety of applications. Among these applications, wireless communication accounts for more than two-thirds of the DSP market today. The Selectable Mode Vocoder (SMV) is a third generation (3G) speech coding technology that provides significant c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007